Menu Top
Non-Rationalised NCERT Books Solution
6th 7th 8th 9th 10th 11th 12th

Class 10th Chapters
1. Real Numbers 2. Polynomials 3. Pair of Linear Equations in Two Variables
4. Quadratic Equations 5. Arithmetic Progressions 6. Triangles
7. Coordinate Geometry 8. Introduction to Trigonometry 9. Some Applications of Trigonometry
10. Circles 11. Constructions 12. Areas Related to Circles
13. Surface Areas And Volumes 14. Statistics 15. Probability

Content On This Page
Example 1 to 3 (Before Exercise 14.1) Exercise 14.11 Example 4 to 6 (Before Exercise 14.2)
Exercise 14.2 Example 7 & 8 (Before Exercise 14.3) Exercise 14.3
Example 9 (Before Exercise 14.4) Exercise 14.4


Chapter 14 Statistics

This solutions guide for Chapter 14: Statistics ventures into the essential techniques for analyzing data that has been organized into groups or classes, known as grouped data. While previous studies might have introduced basic data handling, this chapter significantly deepens our understanding by focusing on calculating the primary measures of central tendency – the mean, median, and mode – specifically adapted for situations where individual data points are aggregated into intervals. When dealing with large datasets, presenting raw data is often impractical and fails to reveal underlying trends or typical values. Grouping data into frequency distributions provides a concise summary, but it also necessitates specialized methods to estimate central tendency measures, as the exact values within each class interval are unknown. This chapter provides the statistical toolkit required for this estimation. Furthermore, we explore the power of graphical representation through cumulative frequency distributions and their corresponding curves, known as ogives, which offer visual insights into data distribution and provide a graphical method for estimating the median. Mastering these methods allows for a more sophisticated interpretation and comparison of datasets frequently encountered in various fields like economics, biology, social sciences, and quality control. The techniques presented here build upon foundational concepts, equipping learners with robust tools for descriptive statistical analysis.

Calculating the Mean of Grouped Data involves finding a representative value for the center of the distribution. Since individual data points are lost in grouping, we use the class mark ($x_i$), which is the midpoint of each class interval ($x_i = \frac{\text{Upper limit + Lower limit}}{2}$), as the representative value for all data points within that interval ($f_i$ represents the frequency of the $i$-th class). Three methods are detailed in the solutions:

Calculating the Mode of Grouped Data involves identifying the value that occurs most frequently. For grouped data, the mode lies within the class interval having the highest frequency, known as the modal class. The specific value is estimated using the formula: $\text{Mode} = l + \left[\frac{(f_1 - f_0)}{(2f_1 - f_0 - f_2)}\right] \times h$. Here, $l$ is the lower limit of the modal class, $h$ is the class size (assuming uniform width), $f_1$ is the frequency of the modal class, $f_0$ is the frequency of the class preceding the modal class, and $f_2$ is the frequency of the class succeeding the modal class. The solutions clearly guide the process of identifying the modal class from the frequency table and then substituting the relevant values into the formula.

Calculating the Median of Grouped Data requires finding the middle value of the distribution. This involves using cumulative frequencies. A cumulative frequency table is constructed (usually of the 'less than' type). The first step is to find the median class, which is the class interval whose cumulative frequency is greater than or equal to $\frac{n}{2}$, where $n = \Sigma f_i$ is the total frequency. The median is then calculated using the formula: $\text{Median} = l + \left[\frac{(\frac{n}{2} - cf)}{f}\right] \times h$. In this formula, $l$ is the lower limit of the median class, $n$ is the total number of observations, $cf$ is the cumulative frequency of the class preceding the median class, $f$ is the frequency of the median class, and $h$ is the class size. Solutions demonstrate constructing the cumulative frequency column and applying this formula step-by-step.

Finally, the chapter explores the Graphical Representation of Cumulative Frequency Distributions using curves called Ogives (pronounced 'oh-jives'). There are two types:

Solutions explain the construction process for both types. A key application of ogives is the graphical estimation of the median. The x-coordinate of the point where the 'less than' ogive and the 'more than' ogive intersect gives the median value. Alternatively, the median can be estimated from a single ('less than' type) ogive by locating the $\frac{n}{2}$ value on the vertical (cumulative frequency) axis, drawing a horizontal line to intersect the ogive, and then dropping a perpendicular to the horizontal axis; the x-coordinate of this point approximates the median. These comprehensive tools provide a robust framework for the analysis and interpretation of grouped statistical data.



Example 1 to 3 (Before Exercise 14.1)

Example 1. The marks obtained by 30 students of Class X of a certain school in a Mathematics paper consisting of 100 marks are presented in table below. Find the mean of the marks obtained by the students

Marks obtained ($x_i$) 10 20 36 40 50 56 6070 72 80 88 92 95
Number of students ($f_i$) 1 1 3 4 3 2 44 1 1 2 3 1

Answer:

Given:

The marks obtained by 30 students and their corresponding frequencies are given in the table.

Marks obtained are denoted by $x_i$ and the number of students by $f_i$.


To Find:

The mean of the marks obtained by the students.


Solution:

We will use the direct method to find the mean of the given data.

The formula for the mean ($\bar{x}$) using the direct method is:

$\bar{x} = \frac{\sum (f_i \times x_i)}{\sum f_i}$

First, let's prepare a table to calculate the sum of the products of marks and frequencies ($f_i \times x_i$).

Marks obtained ($x_i$) Number of students ($f_i$) $f_i \times x_i$
101$10 \times 1 = 10$
201$20 \times 1 = 20$
363$36 \times 3 = 108$
404$40 \times 4 = 160$
503$50 \times 3 = 150$
562$56 \times 2 = 112$
604$60 \times 4 = 240$
704$70 \times 4 = 280$
721$72 \times 1 = 72$
801$80 \times 1 = 80$
882$88 \times 2 = 176$
923$92 \times 3 = 276$
951$95 \times 1 = 95$
Total $\sum f_i$ $\sum (f_i \times x_i)$
30 $10+20+108+160+150+112+240+280 \ $$ +72+80 \ $$ +176+276+95 = 1779$

Sum of frequencies, $\sum f_i = 30$

Sum of the products of frequencies and marks, $\sum (f_i \times x_i) = 1779$

Now, substitute these values into the mean formula:

$\bar{x} = \frac{1779}{30}$

$\bar{x} = 59.3$

The mean marks obtained by the students is $59.3$.

Example 2. The table below gives the percentage distribution of female teachers in the primary schools of rural areas of various states and union territories (U.T.) of India. Find the mean percentage of female teachers by all the three methods discussed in this section.

Percentage of female teachers 15 - 25 25 - 35 35 - 45 45 - 55 55 - 65 65 - 75 75 - 85
Number of States/U.T. 6 11 7 4 4 2 1

Answer:

Given

The frequency distribution of the percentage of female teachers in primary schools of rural areas.


To Find

The mean percentage of female teachers using the Direct Method, Assumed Mean Method, and Step-deviation Method.


Solution

First, we calculate the class mark ($x_i$) for each class interval. The class mark is the midpoint of the interval.

$x_i = \frac{\text{Upper class limit} + \text{Lower class limit}}{2}$

We also find the sum of frequencies, $\sum f_i = 6 + 11 + 7 + 4 + 4 + 2 + 1 = 35$.


1. Direct Method

The formula for the mean is $\bar{x} = \frac{\sum f_i x_i}{\sum f_i}$. We create the following table:

Percentage of female teachers Number of States/U.T. ($f_i$) Class Mark ($x_i$) $f_i x_i$
15 - 25620120
25 - 351130330
35 - 45740280
45 - 55450200
55 - 65460240
65 - 75270140
75 - 8518080
Total $\sum f_i = 35$ $\sum f_i x_i = 1390$

Mean, $\bar{x} = \frac{1390}{35} = \frac{278}{7} \approx 39.71$.


2. Assumed Mean Method

The formula is $\bar{x} = a + \frac{\sum f_i d_i}{\sum f_i}$, where $a$ is the assumed mean and $d_i = x_i - a$. Let's choose the assumed mean $a=50$.

$f_i$ $x_i$ $d_i = x_i - 50$ $f_i d_i$
620-30-180
1130-20-220
740-10-70
45000
4601040
2702040
1803030
$\sum f_i = 35$ $\sum f_i d_i = -360$

Mean, $\bar{x} = 50 + \frac{-360}{35} = 50 - \frac{72}{7} = \frac{350-72}{7} = \frac{278}{7} \approx 39.71$.


3. Step-deviation Method

The formula is $\bar{x} = a + h\left(\frac{\sum f_i u_i}{\sum f_i}\right)$, where $u_i = \frac{x_i - a}{h}$ and $h$ is the class size. Here, $a=50$ and the class size $h=10$.

$f_i$ $x_i$ $d_i = x_i - 50$ $u_i = d_i/10$ $f_i u_i$
620-30-3-18
1130-20-2-22
740-10-1-7
450000
4601014
2702024
1803033
$\sum f_i = 35$ $\sum f_i u_i = -36$

Mean, $\bar{x} = 50 + 10 \times \left(\frac{-36}{35}\right) = 50 - \frac{360}{35} = 50 - \frac{72}{7} = \frac{278}{7} \approx 39.71$.


By all three methods, the mean percentage of female teachers is 39.71%.

Example 3. The distribution below shows the number of wickets taken by bowlers in one-day cricket matches. Find the mean number of wickets by choosing a suitable method. What does the mean signify?

Number of wickets 20 - 60 60 - 100 100 - 150 150 - 250 250 - 350 350 - 450
Number of bowlers 7 5 16 12 2 3

Answer:

Given

The frequency distribution of the number of wickets taken by bowlers in one-day cricket matches.


To Find

The mean number of wickets and to explain what the mean signifies.


Solution

The class intervals are of varying sizes (e.g., 60-20=40, 100-60=40, 150-100=50). Because the class sizes are not uniform, the step-deviation method is not suitable. The numerical values of $x_i$ and $f_i$ are large, so the Assumed Mean Method is a good choice to simplify calculations.

The formula for the mean by the Assumed Mean Method is $\bar{x} = a + \frac{\sum f_i d_i}{\sum f_i}$, where $a$ is the assumed mean and $d_i = x_i - a$.

Let's prepare the calculation table. Let's choose the assumed mean $a = 200$.

Number of wickets Number of bowlers ($f_i$) Class Mark ($x_i$) $d_i = x_i - 200$ $f_i d_i$
20 - 60740-160-1120
60 - 100580-120-600
100 - 15016125-75-1200
150 - 2501220000
250 - 3502300100200
350 - 4503400200600
Total $\sum f_i = 45$ $\sum f_i d_i = -2120$

First, let's correct the sum of frequencies: $\sum f_i = 7+5+16+12+2+3 = 45$.

Now, let's find the sum of $f_i d_i$: $\sum f_i d_i = -1120 - 600 - 1200 + 0 + 200 + 600 = -2920 + 800 = -2120$.

Now, we can calculate the mean:

$\bar{x} = a + \frac{\sum f_i d_i}{\sum f_i} = 200 + \frac{-2120}{45}$

$\bar{x} = 200 - \frac{424}{9} \approx 200 - 47.11$

$\bar{x} \approx 152.89$.


The mean number of wickets taken is approximately 152.89.

Significance of the Mean: The mean of 152.89 signifies that, on average, a bowler from this group has taken approximately 153 wickets in their one-day cricket career. It represents the central tendency of the data.



Exercise 14.1

Question 1. A survey was conducted by a group of students as a part of their environment awareness programme, in which they collected the following data regarding the number of plants in 20 houses in a locality. Find the mean number of plants per house.

Number of plants 0 - 2 2 - 4 4 - 6 6 - 8 8 - 10 10 - 12 12 - 14
Number of houses 1 2 1 5 6 2 3

Which method did you use for finding the mean, and why?

Answer:

Given

The frequency distribution of the number of plants in 20 houses.


To Find

The mean number of plants per house, and to state the method used with justification.


Solution

We will use the Direct Method because the numerical values of the frequencies ($f_i$) and the class marks ($x_i$) are small, which makes the direct calculation of the product $f_i x_i$ simple and straightforward.

The formula for the mean using the direct method is:

$\bar{x} = \frac{\sum f_i x_i}{\sum f_i}$

First, we create a table to find the class marks ($x_i$) and the product $f_i x_i$ for each class.

Number of plants Number of houses ($f_i$) Class Mark ($x_i$) $f_i x_i$
0 - 2111
2 - 4236
4 - 6155
6 - 85735
8 - 106954
10 - 1221122
12 - 1431339
Total $\sum f_i = 20$ $\sum f_i x_i = 162$

From the table, we have:

$\sum f_i = 20$

$\sum f_i x_i = 162$

Now, we can calculate the mean:

$\bar{x} = \frac{162}{20} = 8.1$


The mean number of plants per house is 8.1.

The Direct Method was used because the data values were small, making the calculations easy.

Question 2. Consider the following distribution of daily wages of 50 workers of a factory.

Daily wages (in $\textsf{₹}$) 500 - 520 520 - 540 540 - 560 560 - 580 580 - 600
Number of workers 12 14 8 6 10

Find the mean daily wages of the workers of the factory by using an appropriate method.

Answer:

Given

The frequency distribution of the daily wages of 50 workers.


To Find

The mean daily wages of the workers.


Solution

Since the numerical values of the class marks ($x_i$) are large, the Step-deviation Method is an appropriate choice to simplify the calculations.

The formula for the mean using the step-deviation method is:

$\bar{x} = a + h \left( \frac{\sum f_i u_i}{\sum f_i} \right)$

where $a$ is the assumed mean, $h$ is the class size, and $u_i = \frac{x_i - a}{h}$.

From the data, the class size $h = 520 - 500 = 20$. Let's choose the assumed mean $a = 550$.

We create the following table for our calculations:

Daily wages (in $\textsf{₹}$) Number of workers ($f_i$) Class Mark ($x_i$) $u_i = \frac{x_i - 550}{20}$ $f_i u_i$
500 - 52012510-2-24
520 - 54014530-1-14
540 - 560855000
560 - 580657016
580 - 60010590220
Total $\sum f_i = 50$ $\sum f_i u_i = -12$

From the table, we have:

$\sum f_i = 50$

$\sum f_i u_i = -24 - 14 + 0 + 6 + 20 = -12$

Now, we can calculate the mean:

$\bar{x} = 550 + 20 \left( \frac{-12}{50} \right)$

$\bar{x} = 550 - \frac{240}{50} = 550 - 4.8 = 545.2$


The mean daily wages of the workers of the factory is 545.20.

Question 3. The following distribution shows the daily pocket allowance of children of a locality. The mean pocket allowance is Rs 18. Find the missing frequency f.

Daily pocket allowance (in $\textsf{₹}$) 11 - 13 13 - 15 15 - 17 17 - 19 19 - 21 21 - 23 23 - 25
Number of children 7 6 9 13 f 5 4

Answer:

Given

The frequency distribution of daily pocket allowance.

The mean pocket allowance, $\bar{x} = \textsf{₹} 18$.

One frequency is missing, denoted by $f$.


To Find

The missing frequency, $f$.


Solution

We can use any method to solve this, but the Assumed Mean Method or Step-deviation Method can simplify the algebra. Let's use the Assumed Mean Method.

The formula for the mean is $\bar{x} = a + \frac{\sum f_i d_i}{\sum f_i}$, where $a$ is the assumed mean and $d_i = x_i - a$.

Let's choose the assumed mean $a = 18$.

We create the following table for our calculations:

Daily pocket allowance (in $\textsf{₹}$) Number of children ($f_i$) Class Mark ($x_i$) $d_i = x_i - 18$ $f_i d_i$
11 - 13712-6-42
13 - 15614-4-24
15 - 17916-2-18
17 - 19131800
19 - 21$f$202$2f$
21 - 23522420
23 - 25424624
Total $\sum f_i = 44+f$ $\sum f_i d_i = 2f - 40$

From the table, we find the sums:

$\sum f_i = 7+6+9+13+f+5+4 = 44+f$.

$\sum f_i d_i = -42 - 24 - 18 + 0 + 2f + 20 + 24 = 2f - 40$.

Now, we use the mean formula with the given mean $\bar{x} = 18$ and assumed mean $a=18$:

$18 = 18 + \frac{2f - 40}{44 + f}$

$18 - 18 = \frac{2f - 40}{44 + f}$

$0 = \frac{2f - 40}{44 + f}$

For the fraction to be zero, the numerator must be zero.

$2f - 40 = 0$

$2f = 40$

$f = 20$.


The missing frequency is 20.

Question 4. Thirty women were examined in a hospital by a doctor and the number of heartbeats per minute were recorded and summarised as follows. Find the mean heartbeats per minute for these women, choosing a suitable method.

Number of heartbeats per minute 65 - 68 68 - 71 71 - 74 74 - 77 77 - 80 80 - 83 83 - 86
Number of women 2 4 3 8 7 4 2

Answer:

Given

The frequency distribution of heartbeats per minute for 30 women.


To Find

The mean heartbeats per minute.


Solution

The class size is uniform ($h = 3$), and the class marks are decimals. The Step-deviation Method is a suitable choice to simplify the calculations.

The formula for the mean is $\bar{x} = a + h \left( \frac{\sum f_i u_i}{\sum f_i} \right)$, where $u_i = \frac{x_i - a}{h}$.

Let's choose the assumed mean $a = 75.5$ (the middle class mark).

We create the following table for our calculations:

Heartbeats per minute Number of women ($f_i$) Class Mark ($x_i$) $u_i = \frac{x_i - 75.5}{3}$ $f_i u_i$
65 - 68266.5-3-6
68 - 71469.5-2-8
71 - 74372.5-1-3
74 - 77875.500
77 - 80778.517
80 - 83481.528
83 - 86284.536
Total $\sum f_i = 30$ $\sum f_i u_i = 4$

From the table, we have:

$\sum f_i = 30$

$\sum f_i u_i = -6 - 8 - 3 + 0 + 7 + 8 + 6 = 4$.

Now, we can calculate the mean:

$\bar{x} = 75.5 + 3 \left( \frac{4}{30} \right)$

$\bar{x} = 75.5 + \frac{12}{30} = 75.5 + \frac{2}{5} = 75.5 + 0.4$

$\bar{x} = 75.9$.


The mean heartbeats per minute for these women is 75.9.

Question 5. In a retail market, fruit vendors were selling mangoes kept in packing boxes. These boxes contained varying number of mangoes. The following was the distribution of mangoes according to the number of boxes.

Number of mangoes 50 - 52 53 - 55 56 - 58 59 - 61 62 - 64
Number of boxes 15 110 135 115 25

Find the mean number of mangoes kept in a packing box. Which method of finding the mean did you choose?

Answer:

Given

The frequency distribution of the number of mangoes in packing boxes.


To Find

The mean number of mangoes per box and to state the method used with justification.


Solution

The given class intervals are discontinuous. However, we can calculate the class marks as usual. The class marks are 51, 54, 57, 60, 63. The difference between consecutive class marks is uniform ($h=3$). Since the frequencies are large, the Step-deviation Method is the most suitable method to simplify calculations.

The formula for the mean is $\bar{x} = a + h \left( \frac{\sum f_i u_i}{\sum f_i} \right)$, where $u_i = \frac{x_i - a}{h}$.

Let's choose the assumed mean $a = 57$ and the class size $h=3$.

We create the following table for our calculations:

Number of mangoes Number of boxes ($f_i$) Class Mark ($x_i$) $u_i = \frac{x_i - 57}{3}$ $f_i u_i$
50 - 521551-2-30
53 - 5511054-1-110
56 - 581355700
59 - 61115601115
62 - 642563250
Total $\sum f_i = 400$ $\sum f_i u_i = 25$

From the table, we have:

$\sum f_i = 400$

$\sum f_i u_i = -30 - 110 + 0 + 115 + 50 = 25$.

Now, we can calculate the mean:

$\bar{x} = 57 + 3 \left( \frac{25}{400} \right) = 57 + 3 \left( \frac{1}{16} \right)$

$\bar{x} = 57 + \frac{3}{16} = 57 + 0.1875 = 57.1875$.


The mean number of mangoes kept in a packing box is 57.19 (rounded to two decimal places).

The Step-deviation Method was chosen because the frequencies were large and the class marks were uniformly spaced, which simplifies the arithmetic calculations significantly.

Question 6. The table below shows the daily expenditure on food of 25 households in a locality

Daily expenditure (in $\textsf{₹}$) 100 - 150 150 - 200 200 - 250 250 - 300 300 - 350
Number of households 4 5 12 2 2

Find the mean daily expenditure on food by a suitable method.

Answer:

Given

The frequency distribution of daily expenditure on food for 25 households.


To Find

The mean daily expenditure on food.


Solution

The class size is uniform ($h = 50$) and the class marks are large. The Step-deviation Method is a suitable choice to simplify the calculations.

The formula for the mean is $\bar{x} = a + h \left( \frac{\sum f_i u_i}{\sum f_i} \right)$, where $u_i = \frac{x_i - a}{h}$.

Let's choose the assumed mean $a = 225$.

We create the following table for our calculations:

Daily expenditure (in $\textsf{₹}$) Number of households ($f_i$) Class Mark ($x_i$) $u_i = \frac{x_i - 225}{50}$ $f_i u_i$
100 - 1504125-2-8
150 - 2005175-1-5
200 - 2501222500
250 - 300227512
300 - 350232524
Total $\sum f_i = 25$ $\sum f_i u_i = -7$

From the table, we have:

$\sum f_i = 25$

$\sum f_i u_i = -8 - 5 + 0 + 2 + 4 = -7$.

Now, we can calculate the mean:

$\bar{x} = 225 + 50 \left( \frac{-7}{25} \right)$

$\bar{x} = 225 - (50 \times \frac{7}{25}) = 225 - (2 \times 7) = 225 - 14$

$\bar{x} = 211$.


The mean daily expenditure on food is 211.

Question 7. To find out the concentration of SO2 in the air (in parts per million, i.e., ppm), the data was collected for 30 localities in a certain city and is presented below:

Concentration of SO2 (in ppm) Frequency
0.00 - 0.04 4
0.04 - 0.08 9
0.08 - 0.12 9
0.12 - 0.16 2
0.16 - 0.20 4
0.20 - 0.24 2

Find the mean concentration of SO2 in the air.

Answer:

Given

The frequency distribution of the concentration of SO2 in the air for 30 localities.


To Find

The mean concentration of SO2.


Solution

Since the numerical values of the class marks are small decimals, the Direct Method is suitable and easy to apply.

The formula for the mean is $\bar{x} = \frac{\sum f_i x_i}{\sum f_i}$.

We create the following table for our calculations:

Concentration of SO2 (in ppm) Frequency ($f_i$) Class Mark ($x_i$) $f_i x_i$
0.00 - 0.0440.020.08
0.04 - 0.0890.060.54
0.08 - 0.1290.100.90
0.12 - 0.1620.140.28
0.16 - 0.2040.180.72
0.20 - 0.2420.220.44
Total $\sum f_i = 30$ $\sum f_i x_i = 2.96$

From the table, we have:

$\sum f_i = 30$

$\sum f_i x_i = 2.96$

Now, we can calculate the mean:

$\bar{x} = \frac{2.96}{30} \approx 0.09866...$


The mean concentration of SO2 in the air is approximately 0.099 ppm.

Question 8. A class teacher has the following absentee record of 40 students of a class for the whole term. Find the mean number of days a student was absent.

Number of days 0 - 6 6 - 10 10 - 14 14 - 20 20 - 28 28 - 38 38 - 40
Number of students 11 10 7 4 4 3 1

Answer:

Given

The frequency distribution of the number of days students were absent.


To Find

The mean number of days a student was absent.


Solution

The class intervals are of varying sizes, so the step-deviation method cannot be used. We will use the Assumed Mean Method to simplify calculations, as the direct method would involve larger products.

The formula for the mean is $\bar{x} = a + \frac{\sum f_i d_i}{\sum f_i}$, where $a$ is the assumed mean and $d_i = x_i - a$.

Let's choose the assumed mean $a = 17$.

We create the following table for our calculations:

Number of days Number of students ($f_i$) Class Mark ($x_i$) $d_i = x_i - 17$ $f_i d_i$
0 - 6113-14-154
6 - 10108-9-90
10 - 14712-5-35
14 - 2041700
20 - 28424728
28 - 383331648
38 - 401392222
Total $\sum f_i = 40$ $\sum f_i d_i = -181$

From the table, we have:

$\sum f_i = 40$

$\sum f_i d_i = -154 - 90 - 35 + 0 + 28 + 48 + 22 = -279 + 98 = -181$.

Now, we can calculate the mean:

$\bar{x} = 17 + \frac{-181}{40} = 17 - 4.525$

$\bar{x} = 12.475$.


The mean number of days a student was absent is 12.475.

Question 9. The following table gives the literacy rate (in percentage) of 35 cities. Find the mean literacy rate

Literacy rate (in %) 45 - 55 55 - 65 65 - 75 75 - 85 85 - 95
Number of cities 3 10 11 8 3

Answer:

Given

The frequency distribution of the literacy rate of 35 cities.


To Find

The mean literacy rate.


Solution

The class size is uniform ($h = 10$). We can use the Step-deviation Method to find the mean, as it will simplify the calculations.

The formula for the mean is $\bar{x} = a + h \left( \frac{\sum f_i u_i}{\sum f_i} \right)$, where $u_i = \frac{x_i - a}{h}$.

Let's choose the assumed mean $a = 70$.

We create the following table for our calculations:

Literacy rate (in %) Number of cities ($f_i$) Class Mark ($x_i$) $u_i = \frac{x_i - 70}{10}$ $f_i u_i$
45 - 55350-2-6
55 - 651060-1-10
65 - 75117000
75 - 8588018
85 - 9539026
Total $\sum f_i = 35$ $\sum f_i u_i = -2$

From the table, we have:

$\sum f_i = 35$

$\sum f_i u_i = -6 - 10 + 0 + 8 + 6 = -2$.

Now, we can calculate the mean:

$\bar{x} = 70 + 10 \left( \frac{-2}{35} \right) = 70 - \frac{20}{35} = 70 - \frac{4}{7}$

$\bar{x} = \frac{490 - 4}{7} = \frac{486}{7} \approx 69.43$.


The mean literacy rate is approximately 69.43%.



Example 4 to 6 (Before Exercise 14.2)

Example 4. The wickets taken by a bowler in 10 cricket matches are as follows:

2645021323

Find the mode of the data

Answer:

Given

The number of wickets taken by a bowler in 10 cricket matches are:

2, 6, 4, 5, 0, 2, 1, 3, 2, 3


To Find

The mode of the data.


Solution

The mode is the value that appears most frequently in a data set.

To find the mode, let's create a frequency table for the given data.

Number of wickets Tally Marks Frequency (Number of matches)
0|1
1|1
2|||3
3||2
4|1
5|1
6|1

From the frequency table, we can see that the number of wickets '2' has the highest frequency of 3.


Therefore, the mode of the data is 2.

Example 5. A survey conducted on 20 households in a locality by a group of students resulted in the following frequency table for the number of family members in a household:

Family size 1 - 3 3 - 5 5 - 7 7 - 9 9 - 11
Number of families 7 8 2 2 1

Find the mode of this data

Answer:

Given

A grouped frequency distribution of family size in 20 households.


To Find

The mode of the data.


Solution

To find the mode for grouped data, we use the formula:

Mode $= l + \left(\frac{f_1 - f_0}{2f_1 - f_0 - f_2}\right) \times h$

Step 1: Identify the modal class and related values.

The modal class is the class with the highest frequency. Looking at the table, the highest frequency is 8, which corresponds to the class interval 3 - 5.

From this, we get:

  • Lower limit of the modal class ($l$) = 3
  • Frequency of the modal class ($f_1$) = 8
  • Frequency of the class preceding the modal class ($f_0$) = 7
  • Frequency of the class succeeding the modal class ($f_2$) = 2
  • Class size ($h$) = 5 - 3 = 2

Step 2: Substitute the values into the formula.

Mode $= 3 + \left(\frac{8 - 7}{2(8) - 7 - 2}\right) \times 2$

Mode $= 3 + \left(\frac{1}{16 - 9}\right) \times 2$

Mode $= 3 + \left(\frac{1}{7}\right) \times 2$

Mode $= 3 + \frac{2}{7}$

Mode $= \frac{21+2}{7} = \frac{23}{7}$

Mode $\approx 3.286$.


The mode of this data is approximately 3.286.

Example 6. The marks distribution of 30 students in a mathematics examination are given in Table 14.3 of Example 1. Find the mode of this data. Also compare and interpret the mode and the mean.

Table 14.3

Class interval Number of students ($f_i$)
10 - 25 2
25 - 40 3
40 - 55 7
55 - 70 6
70 - 85 6
85 - 100 6
Total $\sum f_i = 30$

Answer:

Given

The frequency distribution of marks of 30 students.

From Example 1, the mean of this data was calculated to be 62.


To Find

1. The mode of the data.

2. Compare and interpret the mode and the mean.


Solution

1. Finding the Mode:

We use the formula: Mode $= l + \left(\frac{f_1 - f_0}{2f_1 - f_0 - f_2}\right) \times h$

First, we identify the modal class. The highest frequency is 7, which corresponds to the class interval 40 - 55.

From this, we get:

  • $l = 40$
  • $f_1 = 7$
  • $f_0 = 3$ (frequency of the class 25 - 40)
  • $f_2 = 6$ (frequency of the class 55 - 70)
  • $h = 55 - 40 = 15$

Substitute these values into the formula:

Mode $= 40 + \left(\frac{7 - 3}{2(7) - 3 - 6}\right) \times 15$

Mode $= 40 + \left(\frac{4}{14 - 9}\right) \times 15$

Mode $= 40 + \left(\frac{4}{5}\right) \times 15$

Mode $= 40 + (4 \times 3) = 40 + 12 = 52$.

The mode of the marks is 52.

2. Comparison and Interpretation:

We have:

Mean = 62 marks.

Mode = 52 marks.

Interpretation: The mode (52) indicates that the highest number of students scored marks in the range which has 52 as the central value. It represents the most frequent score range.

The mean (62) represents the average score of all 30 students. If the total marks were distributed equally among all students, each would get 62 marks.

The mean is greater than the mode. This suggests that while the most common group of scores is around 52, there are enough students with higher scores to pull the average up. The data is slightly skewed to the right (positively skewed).



Exercise 14.2

Question 1. The following table shows the ages of the patients admitted in a hospital during a year:

Age (in years) 5 - 15 15 - 25 25 - 35 35 - 45 45 - 55 55 - 65
Number of patients 6 11 21 23 14 5

Find the mode and the mean of the data given above. Compare and interpret the two measures of central tendency.

Answer:

Given:

A frequency distribution table of the ages of patients admitted to a hospital.

Age (in years) 5 - 15 15 - 25 25 - 35 35 - 45 45 - 55 55 - 65
Number of patients ($f_i$) 6 11 21 23 14 5

To Find:

The mode and mean of the data, followed by a comparison and interpretation of the results.


Solution:

Calculation for Mode

The formula for the mode of grouped data is:

Mode $= l + \left(\frac{f_1 - f_0}{2f_1 - f_0 - f_2}\right) \times h$

First, we identify the modal class, which is the class with the highest frequency. Here, the highest frequency is 23, corresponding to the class interval 35 - 45.

From the data, we have:

  • Lower limit of the modal class ($l$) = 35
  • Class size ($h$) = 10
  • Frequency of the modal class ($f_1$) = 23
  • Frequency of the preceding class ($f_0$) = 21
  • Frequency of the succeeding class ($f_2$) = 14

Substituting these values into the formula:

Mode $= 35 + \left(\frac{23 - 21}{2(23) - 21 - 14}\right) \times 10$

Mode $= 35 + \left(\frac{2}{46 - 35}\right) \times 10$

Mode $= 35 + \left(\frac{2}{11}\right) \times 10$

Mode $= 35 + \frac{20}{11} \approx 35 + 1.82$

Mode $\approx 36.82$

The mode of the data is approximately 36.82 years.

Calculation for Mean

We will use the Step-deviation method to find the mean. The formula is:

Mean $(\bar{x}) = a + \left(\frac{\sum f_i u_i}{\sum f_i}\right) \times h$

Let's create the calculation table. Let the assumed mean ($a$) be 30, and class size ($h$) is 10.

Age (in years) Number of patients ($f_i$) Class Mark ($x_i$) $u_i = \frac{x_i - 30}{10}$ $f_i u_i$
5 - 15610-2-12
15 - 251120-1-11
25 - 35213000
35 - 452340123
45 - 551450228
55 - 65560315
Total $\sum f_i = 80$ $\sum f_i u_i = 43$

From the table, $\sum f_i = 80$ and $\sum f_i u_i = (-12 - 11) + (23 + 28 + 15) = -23 + 66 = 43$.

Substituting these values into the formula:

$\bar{x} = 30 + \left(\frac{43}{80}\right) \times 10$

$\bar{x} = 30 + \frac{430}{80} = 30 + \frac{43}{8}$

$\bar{x} = 30 + 5.375$

$\bar{x} = 35.375$

The mean of the data is 35.375 years.


Comparison and Interpretation:

The mode is 36.82 years, which represents the age with the highest frequency of patients. This implies that the maximum number of patients admitted are around 37 years old.

The mean is 35.375 years, which represents the average age of all patients admitted to the hospital.

The two values are quite close. This indicates that the distribution of patient ages is nearly symmetrical. The mode being slightly higher than the mean suggests that while the average age is around 35.4, the most common age group is slightly older.

Question 2. The following data gives the information on the observed lifetimes (in hours) of 225 electrical components :

Lifetimes (in hours) 0 - 20 20 - 40 40 - 60 60 - 80 80 - 100 100 - 120
Frequency 10 35 52 61 38 29

Determine the modal lifetimes of the components.

Answer:

Given:

A frequency distribution table showing the lifetimes of 225 electrical components.

Lifetimes (in hours) 0 - 20 20 - 40 40 - 60 60 - 80 80 - 100 100 - 120
Frequency ($f_i$) 10 35 52 61 38 29

To Find:

The modal lifetimes of the components.


Solution:

The formula for the mode of grouped data is:

Mode $= l + \left(\frac{f_1 - f_0}{2f_1 - f_0 - f_2}\right) \times h$

First, we identify the modal class by finding the class with the highest frequency. The maximum frequency is 61, which corresponds to the class interval 60 - 80.

From the data, we have:

  • Lower limit of the modal class ($l$) = 60
  • Class size ($h$) = 20
  • Frequency of the modal class ($f_1$) = 61
  • Frequency of the preceding class ($f_0$) = 52
  • Frequency of the succeeding class ($f_2$) = 38

Substituting these values into the formula:

Mode $= 60 + \left(\frac{61 - 52}{2(61) - 52 - 38}\right) \times 20$

Mode $= 60 + \left(\frac{9}{122 - 90}\right) \times 20$

Mode $= 60 + \left(\frac{9}{32}\right) \times 20$

Mode $= 60 + \frac{180}{32} = 60 + \frac{45}{8}$

Mode $= 60 + 5.625$

Mode $= 65.625$

Therefore, the modal lifetime of the electrical components is 65.625 hours. This means the most common lifetime for these components is approximately 65.625 hours.

Question 3. The following data gives the distribution of total monthly household expenditure of 200 families of a village. Find the modal monthly expenditure of the families. Also, find the mean monthly expenditure :

Expenditure (in $\textsf{₹}$) Number of families
1000 - 1500 24
1500 - 2000 40
2000 - 2500 33
2500 - 3000 28
3000 - 3500 30
3500 - 4000 22
4000 - 4500 16
4500 - 5000 7

Answer:

Given:

A frequency distribution table of the monthly household expenditure of 200 families.


To Find:

The modal and mean monthly expenditure.


Solution:

Calculation for Mode

The formula for the mode is: Mode $= l + \left(\frac{f_1 - f_0}{2f_1 - f_0 - f_2}\right) \times h$

The highest frequency is 40, so the modal class is 1500 - 2000.

We have:

  • $l = 1500$
  • $h = 500$
  • $f_1 = 40$
  • $f_0 = 24$
  • $f_2 = 33$

Substituting the values:

Mode $= 1500 + \left(\frac{40 - 24}{2(40) - 24 - 33}\right) \times 500$

Mode $= 1500 + \left(\frac{16}{80 - 57}\right) \times 500$

Mode $= 1500 + \left(\frac{16}{23}\right) \times 500$

Mode $= 1500 + \frac{8000}{23} \approx 1500 + 347.83$

Mode $\approx 1847.83$

The modal monthly expenditure is $\textsf{₹}1847.83$.

Calculation for Mean

Using the Step-deviation method: Mean $(\bar{x}) = a + \left(\frac{\sum f_i u_i}{\sum f_i}\right) \times h$

Let the assumed mean ($a$) be 2750, and class size ($h$) is 500.

Expenditure (in $\textsf{₹}$) No. of families ($f_i$) Class Mark ($x_i$) $u_i = \frac{x_i - 2750}{500}$ $f_i u_i$
1000 - 1500241250-3-72
1500 - 2000401750-2-80
2000 - 2500332250-1-33
2500 - 300028275000
3000 - 3500303250130
3500 - 4000223750244
4000 - 4500164250348
4500 - 500074750428
Total $\sum f_i = 200$ $\sum f_i u_i = -35$

From the table, $\sum f_i = 200$ and $\sum f_i u_i = -185 + 150 = -35$.

Substituting the values:

$\bar{x} = 2750 + \left(\frac{-35}{200}\right) \times 500$

$\bar{x} = 2750 - \frac{35 \times 5}{2} = 2750 - \frac{175}{2}$

$\bar{x} = 2750 - 87.5$

$\bar{x} = 2662.5$

The mean monthly expenditure is $\textsf{₹}2662.50$.

Question 4. The following distribution gives the state-wise teacher-student ratio in higher secondary schools of India. Find the mode and mean of this data. Interpret the two measures

Number of students per teacher Number of States / U.T.
15 - 20 3
20 - 25 8
25 - 30 9
30 - 35 10
35 - 40 3
40 - 45 0
45 - 50 0
50 - 55 2

Answer:

Given:

A frequency distribution of the state-wise teacher-student ratio in higher secondary schools of India.


To Find:

The mode and mean of the data, and an interpretation of the results.


Solution:

Calculation for Mode

The mode formula is: Mode $= l + \left(\frac{f_1 - f_0}{2f_1 - f_0 - f_2}\right) \times h$

The highest frequency is 10, making the modal class 30 - 35.

We have:

  • $l = 30$
  • $h = 5$
  • $f_1 = 10$
  • $f_0 = 9$
  • $f_2 = 3$

Substituting the values:

Mode $= 30 + \left(\frac{10 - 9}{2(10) - 9 - 3}\right) \times 5$

Mode $= 30 + \left(\frac{1}{20 - 12}\right) \times 5 = 30 + \frac{5}{8}$

Mode $= 30 + 0.625 = 30.625$

The modal ratio is 30.625.

Calculation for Mean

Using the Step-deviation method: Mean $(\bar{x}) = a + \left(\frac{\sum f_i u_i}{\sum f_i}\right) \times h$

Let the assumed mean ($a$) be 32.5, and class size ($h$) is 5.

Students per Teacher No. of States/U.T. ($f_i$) Class Mark ($x_i$) $u_i = \frac{x_i - 32.5}{5}$ $f_i u_i$
15 - 20317.5-3-9
20 - 25822.5-2-16
25 - 30927.5-1-9
30 - 351032.500
35 - 40337.513
40 - 45042.520
45 - 50047.530
50 - 55252.548
Total $\sum f_i = 35$ $\sum f_i u_i = -23$

From the table, $\sum f_i = 35$ and $\sum f_i u_i = -34 + 11 = -23$.

Substituting the values:

$\bar{x} = 32.5 + \left(\frac{-23}{35}\right) \times 5$

$\bar{x} = 32.5 - \frac{23 \times \cancel{5}^1}{\cancel{35}_7} = 32.5 - \frac{23}{7}$

$\bar{x} \approx 32.5 - 3.29 = 29.21$

The mean ratio is approximately 29.21.


Interpretation:

The mode of 30.625 implies that the most common student-teacher ratio in the states/U.T. is approximately 30.6 students per teacher.

The mean of 29.21 indicates that, on average, there are about 29.2 students per teacher across all the states/U.T.

Question 5. The given distribution shows the number of runs scored by some top batsmen of the world in one-day international cricket matches.

Runs scored Number of batsmen
3000 - 4000 4
4000 - 5000 18
5000 - 6000 9
6000 - 7000 7
7000 - 8000 6
8000 - 9000 3
9000 - 10000 1
10000 - 11000 1

Find the mode of the data.

Answer:

Given:

A frequency distribution table of runs scored by top batsmen.


To Find:

The mode of the given data.


Solution:

The formula to find the mode of grouped data is:

Mode $= l + \left(\frac{f_1 - f_0}{2f_1 - f_0 - f_2}\right) \times h$

From the table, the maximum frequency is 18, which corresponds to the class interval 4000 - 5000. This is our modal class.

We have:

  • Lower limit of the modal class ($l$) = 4000
  • Class size ($h$) = 1000
  • Frequency of the modal class ($f_1$) = 18
  • Frequency of the preceding class ($f_0$) = 4
  • Frequency of the succeeding class ($f_2$) = 9

Substituting these values into the formula:

Mode $= 4000 + \left(\frac{18 - 4}{2(18) - 4 - 9}\right) \times 1000$

Mode $= 4000 + \left(\frac{14}{36 - 13}\right) \times 1000$

Mode $= 4000 + \left(\frac{14}{23}\right) \times 1000$

Mode $= 4000 + \frac{14000}{23} \approx 4000 + 608.70$

Mode $\approx 4608.70$

The mode of the data is approximately 4608.7 runs, indicating the most common range of scores for top batsmen.

Question 6. A student noted the number of cars passing through a spot on a road for 100 periods each of 3 minutes and summarised it in the table given below. Find the mode of the data :

Number of cars 0 - 10 10 - 20 20 - 30 30 - 40 40 - 50 50 - 60 60 - 70 70 - 80
Frequency 7 14 13 12 20 11 15 8

Answer:

Given:

A frequency distribution of the number of cars passing a spot in 100 periods.


To Find:

The mode of the data.


Solution:

The formula to find the mode of grouped data is:

Mode $= l + \left(\frac{f_1 - f_0}{2f_1 - f_0 - f_2}\right) \times h$

The highest frequency in the data is 20, which corresponds to the class interval 40 - 50. This is the modal class.

From the data, we have:

  • Lower limit of the modal class ($l$) = 40
  • Class size ($h$) = 10
  • Frequency of the modal class ($f_1$) = 20
  • Frequency of the preceding class ($f_0$) = 12
  • Frequency of the succeeding class ($f_2$) = 11

Substituting these values into the formula:

Mode $= 40 + \left(\frac{20 - 12}{2(20) - 12 - 11}\right) \times 10$

Mode $= 40 + \left(\frac{8}{40 - 23}\right) \times 10$

Mode $= 40 + \left(\frac{8}{17}\right) \times 10$

Mode $= 40 + \frac{80}{17} \approx 40 + 4.71$

Mode $\approx 44.71$

Therefore, the mode of the data is approximately 44.71 cars. This represents the most frequent number of cars passing the spot during a 3-minute period.



Example 7 & 8 (Before Exercise 14.3)

Example 7. A survey regarding the heights (in cm) of 51 girls of Class X of a school was conducted and the following data was obtained:

Height (in cm) Number of girls
Less than 140 4
Less than 145 11
Less than 150 29
Less than 155 40
Less than 160 46
Less than 165 51

Find the median height

Answer:

Given:

A 'less than' type cumulative frequency distribution of the heights of 51 girls.


To Find:

The median height of the girls.


Solution:

Step 1: Convert the cumulative frequency distribution to a standard frequency distribution.

The given data is of 'less than' type. We need to create class intervals and find the frequency for each class. The class size is constant ($145-140=5$).

Height (in cm) Number of girls (Frequency $f_i$) Cumulative Frequency (CF)
135 - 14044
140 - 145$11 - 4 = 7$11
145 - 150$29 - 11 = 18$29
150 - 155$40 - 29 = 11$40
155 - 160$46 - 40 = 6$46
160 - 165$51 - 46 = 5$51
Total $\sum f_i = N = 51$

Step 2: Identify the median class.

The total number of observations is $N = 51$.

We need to find the value of $\frac{N}{2} = \frac{51}{2} = 25.5$.

Now, we locate the class whose cumulative frequency is just greater than or equal to 25.5. From the table, this cumulative frequency is 29, which corresponds to the class interval 145 - 150.

Therefore, the median class is $145 - 150$.

Step 3: Calculate the median using the formula.

The formula for the median is:

Median $= l + \left(\frac{\frac{N}{2} - CF}{f}\right) \times h$

Here,

  • $l$ = Lower limit of the median class = $145$
  • $N$ = Total frequency = $51$
  • $CF$ = Cumulative frequency of the class preceding the median class = $11$
  • $f$ = Frequency of the median class = $18$
  • $h$ = Class size = $5$

Substituting these values into the formula:

Median $= 145 + \left(\frac{25.5 - 11}{18}\right) \times 5$

Median $= 145 + \left(\frac{14.5}{18}\right) \times 5$

Median $= 145 + \frac{72.5}{18}$

Median $\approx 145 + 4.03$

Median $\approx 149.03$

Hence, the median height of the girls is approximately 149.03 cm.

Example 8. The median of the following data is 525. Find the values of x and y, if the total frequency is 100.

Class interval Frequency
0 - 100 2
100 - 200 5
200 - 300 x
300 - 400 12
400 - 500 17
500 - 600 20
600 - 700 y
700 - 800 9
800 - 900 7
900 - 1000 4

Answer:

Given:

  • A grouped frequency distribution with two unknown frequencies, x and y.
  • The median of the distribution is 525.
  • The total frequency ($N$) is 100.

To Find:

The values of x and y.


Solution:

Step 1: Construct the cumulative frequency table and form the first equation.

We are given that the sum of all frequencies is 100.

Class Interval Frequency ($f_i$) Cumulative Frequency (CF)
0 - 10022
100 - 20057
200 - 300x$7 + x$
300 - 40012$19 + x$
400 - 50017$36 + x$
500 - 60020$56 + x$
600 - 700y$56 + x + y$
700 - 8009$65 + x + y$
800 - 9007$72 + x + y$
900 - 10004$76 + x + y$

The sum of frequencies is $2+5+x+12+17+20+y+9+7+4 = 76 + x + y$.

Since the total frequency is 100, we have:

$76 + x + y = 100$

$x + y = 100 - 76$

$x + y = 24$

... (i)

Step 2: Identify the median class and apply the median formula to find x.

The given median is 525. This value lies in the class interval 500 - 600. So, this is our median class.

The formula for the median is: Median $= l + \left(\frac{\frac{N}{2} - CF}{f}\right) \times h$

Here,

  • $l$ = Lower limit of the median class = $500$
  • $N$ = Total frequency = $100$, so $\frac{N}{2} = 50$
  • $CF$ = Cumulative frequency of the class preceding the median class = $36 + x$
  • $f$ = Frequency of the median class = $20$
  • $h$ = Class size = $100$

Substituting these values into the formula:

$525 = 500 + \left(\frac{50 - (36 + x)}{20}\right) \times 100$

$525 - 500 = \left(\frac{50 - 36 - x}{20}\right) \times 100$

$25 = \left(\frac{14 - x}{\cancel{20}_1}\right) \times \cancel{100}^5$

$25 = (14 - x) \times 5$

$\frac{25}{5} = 14 - x$

$5 = 14 - x$

$x = 14 - 5$

$x = 9$

Step 3: Find the value of y.

Substitute the value of $x = 9$ into equation (i):

$9 + y = 24$

$y = 24 - 9$

$y = 15$

Hence, the values of the missing frequencies are x = 9 and y = 15.



Exercise 14.3

Question 1. The following frequency distribution gives the monthly consumption of electricity of 68 consumers of a locality. Find the median, mean and mode of the data and compare them.

Monthly consumption (in units) Number of consumers
65 - 85 4
85 - 105 5
105 - 125 13
125 - 145 20
145 - 165 14
165 - 185 8
185 - 205 4

Answer:

Given:

A frequency distribution of the monthly electricity consumption of 68 consumers.


To Find:

The median, mean, and mode of the data and to compare them.


Solution:

First, we construct a comprehensive table with all necessary calculations for mean, median, and mode.

Let the assumed mean, $a = 135$. Class size, $h = 20$.

Monthly Consumption (units) No. of Consumers ($f_i$) Class Mark ($x_i$) Cumulative Frequency (CF) $u_i = \frac{x_i - 135}{20}$ $f_i u_i$
65 - 854754-3-12
85 - 1055959-2-10
105 - 1251311522-1-13
125 - 145201354200
145 - 1651415556114
165 - 185817564216
185 - 205419568312
Total $N=\sum f_i = 68$ $\sum f_i u_i = 7$

Calculation for Median

Here, $N = 68$, so $\frac{N}{2} = \frac{68}{2} = 34$.

The cumulative frequency just greater than 34 is 42, which corresponds to the class interval 125 - 145. This is the median class.

Median $= l + \left(\frac{\frac{N}{2} - CF}{f}\right) \times h$

Here, $l=125$, $CF=22$, $f=20$, $h=20$.

Median $= 125 + \left(\frac{34 - 22}{20}\right) \times 20 = 125 + \frac{12}{20} \times 20 = 125 + 12 = 137$.

Median = 137 units.


Calculation for Mean

Using the step-deviation method: Mean $(\bar{x}) = a + \left(\frac{\sum f_i u_i}{\sum f_i}\right) \times h$.

From the table, $\sum f_i u_i = 7$ and $\sum f_i = 68$.

$\bar{x} = 135 + \left(\frac{7}{68}\right) \times 20 = 135 + \frac{140}{68} = 135 + \frac{35}{17} \approx 135 + 2.059$.

Mean $\approx 137.06$ units.


Calculation for Mode

The highest frequency is 20, so the modal class is 125 - 145.

Mode $= l + \left(\frac{f_1 - f_0}{2f_1 - f_0 - f_2}\right) \times h$

Here, $l=125$, $f_1=20$, $f_0=13$, $f_2=14$, $h=20$.

Mode $= 125 + \left(\frac{20 - 13}{2(20) - 13 - 14}\right) \times 20 = 125 + \left(\frac{7}{40 - 27}\right) \times 20 = 125 + \frac{140}{13} \ $$ \approx 125 + 10.77$.

Mode $\approx 135.77$ units.


Comparison of Results:

  • Mean $\approx 137.06$
  • Median $= 137$
  • Mode $\approx 135.77$

The three measures of central tendency are very close to each other. This indicates that the data distribution is nearly symmetrical and bell-shaped, with a central value around 137 units.

Question 2. If the median of the distribution given below is 28.5, find the values of x and y.

Class interval Frequency
0 - 10 5
10 - 20 x
20 - 30 20
30 - 40 15
40 - 50 y
50 - 60 5
Total 60

Answer:

Given:

  • A frequency distribution with two unknown frequencies, x and y.
  • The median of the distribution is 28.5.
  • The total frequency, $N = 60$.

To Find:

The values of x and y.


Solution:

Since there are two unknown variables, we need to form two independent equations to solve for them.

Step 1: Form the first equation using the total frequency.

We first construct the cumulative frequency (CF) table.

Class Interval Frequency ($f_i$) Cumulative Frequency (CF)
0 - 1055
10 - 20x$5 + x$
20 - 3020$25 + x$
30 - 4015$40 + x$
40 - 50y$40 + x + y$
50 - 605$45 + x + y$

The sum of all frequencies is given as 60. From the table, this sum is $45 + x + y$.

$45 + x + y = 60$

$x + y = 60 - 45$

$x + y = 15$

... (i)

Step 2: Form the second equation using the given median.

The median is given as 28.5, which lies in the class interval 20 - 30. So, this is the median class.

The median formula is: Median $= l + \left(\frac{\frac{N}{2} - CF}{f}\right) \times h$

Here, $l=20$, $N=60$ (so $\frac{N}{2}=30$), $CF = 5+x$ (CF of the preceding class), $f=20$, and $h=10$.

Substituting these values:

$28.5 = 20 + \left(\frac{30 - (5 + x)}{20}\right) \times 10$

$28.5 - 20 = \left(\frac{25 - x}{20}\right) \times 10$

$8.5 = \frac{25 - x}{2}$

$8.5 \times 2 = 25 - x$

$17 = 25 - x$

$x = 25 - 17 \implies x = 8$.

Step 3: Solve for y.

Substitute $x=8$ into equation (i):

$8 + y = 15$

$y = 15 - 8 \implies y = 7$.


The values of the missing frequencies are x = 8 and y = 7.

Question 3. A life insurance agent found the following data for distribution of ages of 100 policy holders. Calculate the median age, if policies are given only to persons having age 18 years onwards but less than 60 year.

Age (in years) Number of policy holders
Below 20 2
Below 25 6
Below 30 24
Below 35 45
Below 40 78
Below 45 89
Below 50 92
Below 55 98
Below 60 100

Answer:

Given:

A 'less than' type cumulative frequency distribution of the ages of 100 policy holders, aged 18 years and above.


To Find:

The median age of the policy holders.


Solution:

Step 1: Convert the given data into a frequency distribution table.

The given data is cumulative. We need to find the frequency of each class interval. Since policies start at age 18, the first class interval will be 18-20.

Age (in years) Number of policy holders (Frequency $f_i$) Cumulative Frequency (CF)
18 - 2022
20 - 25$6 - 2 = 4$6
25 - 30$24 - 6 = 18$24
30 - 35$45 - 24 = 21$45
35 - 40$78 - 45 = 33$78
40 - 45$89 - 78 = 11$89
45 - 50$92 - 89 = 3$92
50 - 55$98 - 92 = 6$98
55 - 60$100 - 98 = 2$100
Total $N = 100$

Step 2: Identify the median class.

Here, $N = 100$, so $\frac{N}{2} = \frac{100}{2} = 50$.

The cumulative frequency just greater than 50 is 78, which corresponds to the class interval 35 - 40. This is the median class.

Step 3: Calculate the median.

Median $= l + \left(\frac{\frac{N}{2} - CF}{f}\right) \times h$

Here, $l=35$, $CF=45$ (CF of the preceding class), $f=33$, and $h=5$.

Median $= 35 + \left(\frac{50 - 45}{33}\right) \times 5$

Median $= 35 + \frac{5}{33} \times 5 = 35 + \frac{25}{33}$

Median $\approx 35 + 0.7576$

Median $\approx 35.76$

The median age of the policy holders is approximately 35.76 years.

Question 4. The lengths of 40 leaves of a plant are measured correct to the nearest millimetre, and the data obtained is represented in the following table :

Length (in mm) Number of leaves
118 - 126 3
127 - 135 5
136 - 144 9
145 - 153 12
154 - 162 5
163 - 171 4
172 - 180 2

Find the median length of the leaves.

(Hint: The data needs to be converted to continuous classes for finding the median, since the formula assumes continuous classes. The classes then change to 117.5 - 126.5, 126.5 - 135.5, . . ., 171.5 - 180.5.)

Answer:

Given:

A frequency distribution of the lengths of 40 leaves with discontinuous class intervals.


To Find:

The median length of the leaves.


Solution:

Step 1: Convert the discontinuous classes to continuous classes.

The median formula requires continuous class intervals. There is a gap of 1 mm between the upper limit of one class and the lower limit of the next (e.g., 126 and 127). The adjustment factor is half the gap, i.e., $\frac{1}{2} = 0.5$. We subtract 0.5 from each lower limit and add 0.5 to each upper limit.

Length (in mm) No. of Leaves ($f_i$) Cumulative Frequency (CF)
117.5 - 126.533
126.5 - 135.558
135.5 - 144.5917
144.5 - 153.51229
153.5 - 162.5534
162.5 - 171.5438
171.5 - 180.5240
Total $N = 40$

Step 2: Identify the median class.

Here, $N = 40$, so $\frac{N}{2} = \frac{40}{2} = 20$.

The cumulative frequency just greater than 20 is 29, corresponding to the class 144.5 - 153.5. This is the median class.

Step 3: Calculate the median.

Median $= l + \left(\frac{\frac{N}{2} - CF}{f}\right) \times h$

Here, $l=144.5$, $CF=17$, $f=12$, and the class size $h = 153.5 - 144.5 = 9$.

Median $= 144.5 + \left(\frac{20 - 17}{12}\right) \times 9$

Median $= 144.5 + \left(\frac{3}{12}\right) \times 9 = 144.5 + \frac{1}{4} \times 9$

Median $= 144.5 + \frac{9}{4} = 144.5 + 2.25$

Median $= 146.75$

The median length of the leaves is 146.75 mm.

Question 5. The following table gives the distribution of the life time of 400 neon lamps :

Life time (in hours) Number of lamps
1500 - 2000 14
2000 - 2500 56
2500 - 3000 60
3000 - 3500 86
3500 - 4000 74
4000 - 4500 62
4500 - 5000 48

Find the median life time of a lamp.

Answer:

Given:

The frequency distribution of the lifetime of 400 neon lamps.


To Find:

The median lifetime of a lamp.


Solution:

Step 1: Construct the cumulative frequency (CF) table.

Life time (in hours) No. of lamps ($f_i$) Cumulative Frequency (CF)
1500 - 20001414
2000 - 25005670
2500 - 300060130
3000 - 350086216
3500 - 400074290
4000 - 450062352
4500 - 500048400
Total $N = 400$

Step 2: Identify the median class.

Total frequency $N = 400$, so $\frac{N}{2} = \frac{400}{2} = 200$.

The cumulative frequency just greater than 200 is 216, which corresponds to the class interval 3000 - 3500. This is the median class.

Step 3: Calculate the median.

Median $= l + \left(\frac{\frac{N}{2} - CF}{f}\right) \times h$

Here, $l=3000$, $CF=130$, $f=86$, and $h=500$.

Median $= 3000 + \left(\frac{200 - 130}{86}\right) \times 500$

Median $= 3000 + \left(\frac{70}{86}\right) \times 500 = 3000 + \frac{35000}{86}$

Median $\approx 3000 + 406.98$

Median $\approx 3406.98$

The median lifetime of a lamp is approximately 3406.98 hours.

Question 6. 100 surnames were randomly picked up from a local telephone directory and the frequency distribution of the number of letters in the English alphabets in the surnames was obtained as follows:

Number of letters 1 - 4 4 - 7 7 - 10 10 - 13 13 - 16 16 - 19
Number of surnames 6 30 40 16 4 4

Determine the median number of letters in the surnames. Find the mean number of letters in the surnames? Also, find the modal size of the surnames.

Answer:

Given:

A frequency distribution of the number of letters in 100 surnames.


To Find:

The median, mean, and mode for the number of letters in the surnames.


Solution:

First, we create a master table for all necessary calculations. Let the assumed mean, $a = 8.5$. Class size, $h = 3$.

No. of Letters No. of Surnames ($f_i$) Class Mark ($x_i$) CF $u_i = \frac{x_i - 8.5}{3}$ $f_i u_i$
1 - 462.56-2-12
4 - 7305.536-1-30
7 - 10408.57600
10 - 131611.592116
13 - 16414.59628
16 - 19417.5100312
Total $N = 100$ $\sum f_i u_i = -6$

Median Calculation

Here, $N = 100$, so $\frac{N}{2} = 50$. The CF just greater than 50 is 76, so the median class is 7 - 10.

Median $= l + \left(\frac{\frac{N}{2} - CF}{f}\right) \times h = 7 + \left(\frac{50 - 36}{40}\right) \times 3 = 7 + \frac{14 \times 3}{40} \ $$ = 7 + \frac{42}{40} \ $$ = 7 + 1.05$.

Median = 8.05 letters.


Mean Calculation

Mean $(\bar{x}) = a + \left(\frac{\sum f_i u_i}{\sum f_i}\right) \times h = 8.5 + \left(\frac{-6}{100}\right) \times 3 = 8.5 - \frac{18}{100} \ $$ = 8.5 - 0.18$.

Mean = 8.32 letters.


Mode Calculation

The highest frequency is 40, so the modal class is 7 - 10.

Mode $= l + \left(\frac{f_1 - f_0}{2f_1 - f_0 - f_2}\right) \times h = 7 + \left(\frac{40 - 30}{2(40) - 30 - 16}\right) \times 3 \ $$ = 7 + \left(\frac{10}{80 - 46}\right) \times 3 \ $$ = 7 + \frac{30}{34} \approx 7 + 0.88$.

Mode $\approx 7.88$ letters.

Question 7. The distribution below gives the weights of 30 students of a class. Find the median weight of the students

Weight (in kg) 40 - 45 45 - 50 50 - 55 55 - 60 60 - 65 65 - 70 70 - 75
Number of students 2 3 8 6 6 3 2

Answer:

Given:

The frequency distribution of the weights of 30 students.


To Find:

The median weight of the students.


Solution:

Step 1: Create the cumulative frequency (CF) table.

Weight (in kg) No. of students ($f_i$) Cumulative Frequency (CF)
40 - 4522
45 - 5035
50 - 55813
55 - 60619
60 - 65625
65 - 70328
70 - 75230
Total $N = 30$

Step 2: Identify the median class.

Total frequency $N = 30$, so $\frac{N}{2} = \frac{30}{2} = 15$.

The cumulative frequency just greater than 15 is 19, which corresponds to the class interval 55 - 60. This is the median class.

Step 3: Calculate the median.

Median $= l + \left(\frac{\frac{N}{2} - CF}{f}\right) \times h$

Here, $l=55$, $CF=13$, $f=6$, and $h=5$.

Median $= 55 + \left(\frac{15 - 13}{6}\right) \times 5$

Median $= 55 + \frac{2}{6} \times 5 = 55 + \frac{1}{3} \times 5$

Median $= 55 + \frac{5}{3} \approx 55 + 1.67$

Median $\approx 56.67$

The median weight of the students is approximately 56.67 kg.



Example 9 (Before Exercise 14.4)

Example 9. The annual profits earned by 30 shops of a shopping complex in a locality give rise to the following distribution :

Profit (in lakhs of $\textsf{₹}$) More than or equal to Number of shops (cumulative frequency)
More than or equal to 530
More than or equal to 1028
More than or equal to 1516
More than or equal to 2014
More than or equal to 2510
More than or equal to 307
More than or equal to 353

Draw both ogives for the data above. Hence obtain the median profit.

Answer:

Given:

A 'more than or equal to' type cumulative frequency distribution of the annual profits of 30 shops.


To Do:

Draw both 'less than' and 'more than' ogives for the data and find the median profit from the graph.


Solution:

Step 1: Convert the given data into a frequency distribution and prepare data for both ogives.

We first construct a frequency distribution table from the 'more than' data. Then, we find the 'less than' cumulative frequencies.

Profit (in lakhs of ₹) Frequency ($f_i$) 'Less than' Cumulative Frequency 'More than' Cumulative Frequency
5 - 10$30 - 28 = 2$230
10 - 15$28 - 16 = 12$1428
15 - 20$16 - 14 = 2$1616
20 - 25$14 - 10 = 4$2014
25 - 30$10 - 7 = 3$2310
30 - 35$7 - 3 = 4$277
35 - 40$3 - 0 = 3$303
Total $N=30$

Step 2: Determine the points for plotting the ogives.

For the 'Less than' Ogive, we plot the upper class limits against their corresponding cumulative frequencies:

Points: (10, 2), (15, 14), (20, 16), (25, 20), (30, 23), (35, 27), (40, 30). We also add the point (5, 0).

For the 'More than' Ogive, we plot the lower class limits against their corresponding cumulative frequencies:

Points: (5, 30), (10, 28), (15, 16), (20, 14), (25, 10), (30, 7), (35, 3).

Step 3: Draw the ogives and find the median.

We plot these points on a graph with Profit (in lakhs of ₹) on the x-axis and the Number of Shops (cumulative frequency) on the y-axis. We then join the points for each ogive with a smooth curve.

Less than and More than Ogives intersecting to find the median.

The two ogives intersect at a point. We draw a perpendicular from this intersection point to the x-axis. The x-coordinate of this point gives the median of the data.

From the graph, the intersection point is approximately (17.5, 15).

The x-coordinate is 17.5.

Hence, the median profit is $\textsf{₹}17.5$ lakhs.


Verification by Formula

Total frequency $N = 30$, so $\frac{N}{2} = 15$.

From the 'less than' CF column, the CF just greater than 15 is 16. The corresponding class is 15 - 20, which is the median class.

Median $= l + \left(\frac{\frac{N}{2} - CF}{f}\right) \times h$

Here, $l=15$, $CF=14$, $f=2$, and $h=5$.

Median $= 15 + \left(\frac{15 - 14}{2}\right) \times 5 = 15 + \frac{1}{2} \times 5 = 15 + 2.5 = 17.5$.

The calculated median matches the value obtained from the graph.



Exercise 14.4

Question 1. The following distribution gives the daily income of 50 workers of a factory

Daily income (in $\textsf{₹}$) 100 - 120 120 - 140 140 - 160 160 - 180 180 - 200
Number of workers 12 14 8 6 10

Convert the distribution above to a less than type cumulative frequency distribution, and draw its ogive.

Answer:

Given:

A frequency distribution of the daily income of 50 factory workers.


To Do:

1. Convert the distribution to a 'less than' type cumulative frequency distribution.

2. Draw the 'less than' ogive for this data.


Solution:

Step 1: Create the 'less than' type cumulative frequency distribution.

We use the upper limits of the class intervals to define the 'less than' categories and calculate the cumulative frequencies by successively adding the frequencies.

Daily income (in $\textsf{₹}$) Number of workers (Cumulative Frequency) Coordinates for Ogive (Upper Limit, CF)
Less than 12012(120, 12)
Less than 140$12 + 14 = 26$(140, 26)
Less than 160$26 + 8 = 34$(160, 34)
Less than 180$34 + 6 = 40$(180, 40)
Less than 200$40 + 10 = 50$(200, 50)

Step 2: Draw the 'less than' ogive.

To draw the ogive, we plot the coordinates derived in the table on a graph. The x-axis represents the Daily Income (in $\textsf{₹}$), and the y-axis represents the Number of Workers (Cumulative Frequency).

The points to be plotted are: (120, 12), (140, 26), (160, 34), (180, 40), (200, 50).

We also add the point (100, 0) corresponding to the lower limit of the first class interval, as there are no workers with income less than $\textsf{₹}100$.

By plotting these points and joining them with a smooth freehand curve, we obtain the 'less than' ogive.

Less than ogive for the daily income of workers.

Question 2. During the medical check-up of 35 students of a class, their weights were recorded as follows:

Weight (in kg) Number of students
Less than 38 0
Less than 40 3
Less than 42 5
Less than 44 9
Less than 46 14
Less than 48 28
Less than 50 32
Less than 52 35

Draw a less than type ogive for the given data. Hence obtain the median weight from the graph and verify the result by using the formula.

Answer:

Given:

A 'less than' type cumulative frequency distribution of the weights of 35 students.


To Do:

1. Draw a 'less than' type ogive for the data.

2. Obtain the median weight from the graph.

3. Verify the result using the median formula.


Solution:

Step 1: Draw the 'less than' ogive.

The data is already in the required format. The points for the ogive are the upper limits of weight and the corresponding cumulative frequencies.

Points to plot: (38, 0), (40, 3), (42, 5), (44, 9), (46, 14), (48, 28), (50, 32), (52, 35).

Plot these points on a graph with Weight (in kg) on the x-axis and Number of Students (CF) on the y-axis. Join them with a smooth curve to get the ogive.

Less than ogive for the weights of students.

Step 2: Obtain the median from the graph.

Total number of students, $N = 35$. The median position is $\frac{N}{2} = \frac{35}{2} = 17.5$.

On the graph, locate 17.5 on the y-axis. Draw a horizontal line from this point to intersect the ogive. From the intersection point, draw a perpendicular to the x-axis. The point where this perpendicular meets the x-axis is the median.

From the graph, this value is found to be 46.5 kg.


Step 3: Verify the median using the formula.

First, we create a standard frequency distribution from the given cumulative data.

Weight (in kg) Frequency ($f_i$) Cumulative Frequency (CF)
Below 3800
38 - 40$3 - 0 = 3$3
40 - 42$5 - 3 = 2$5
42 - 44$9 - 5 = 4$9
44 - 46$14 - 9 = 5$14
46 - 48$28 - 14 = 14$28
48 - 50$32 - 28 = 4$32
50 - 52$35 - 32 = 3$35

Since $\frac{N}{2} = 17.5$, the CF just greater than 17.5 is 28. The corresponding class, 46 - 48, is the median class.

Using the formula: Median $= l + \left(\frac{\frac{N}{2} - CF}{f}\right) \times h$

Here, $l=46$, $CF=14$, $f=14$, and $h=2$.

Median $= 46 + \left(\frac{17.5 - 14}{14}\right) \times 2 = 46 + \frac{3.5}{14} \times 2 = 46 + \frac{7}{14} \ $$ = 46 + 0.5 \ $$ = 46.5$.

The median weight calculated using the formula is 46.5 kg, which successfully verifies the result obtained from the graph.

Question 3. The following table gives production yield per hectare of wheat of 100 farms of a village.

Production yield (in kg/ha) 50 - 55 55 - 60 60 - 65 65 - 70 70 - 75 75 - 80
Number of farms 2 8 12 24 38 16

Change the distribution to a more than type distribution, and draw its ogive.

Answer:

Given:

A frequency distribution of the production yield for 100 farms.


To Do:

1. Change the distribution to a 'more than' type distribution.

2. Draw its 'more than' ogive.


Solution:

Step 1: Create the 'more than' type cumulative frequency distribution.

We use the lower limits of the class intervals and calculate the cumulative frequency by starting with the total frequency (100) and successively subtracting the frequencies of the preceding classes.

Production yield (in kg/ha) Number of farms (Cumulative Frequency) Coordinates for Ogive (Lower Limit, CF)
More than or equal to 50100(50, 100)
More than or equal to 55$100 - 2 = 98$(55, 98)
More than or equal to 60$98 - 8 = 90$(60, 90)
More than or equal to 65$90 - 12 = 78$(65, 78)
More than or equal to 70$78 - 24 = 54$(70, 54)
More than or equal to 75$54 - 38 = 16$(75, 16)

Step 2: Draw the 'more than' ogive.

To draw the 'more than' ogive, we plot the coordinates derived above on a graph. The x-axis represents the Production Yield (in kg/ha), and the y-axis represents the Number of Farms (Cumulative Frequency).

The points to be plotted are: (50, 100), (55, 98), (60, 90), (65, 78), (70, 54), (75, 16).

By plotting these points and joining them with a smooth freehand curve, we obtain the 'more than' ogive.

More than ogive for the production yield of farms.